Soft-decision a Priori Knowledge Interpolation for Robust Telephone Speaker Identification
نویسندگان
چکیده
Handsets which are not seen in the training phase (a.k.a unseen handsets) are main sources of performance degradation for speaker identification (SID) applications in telecommunication environments. To alleviate the problem, a soft-decision a priori knowledge interpolation (SD-AKI) method of handset characteristic estimation for handset mismatch-compensated SID is proposed in this paper. The idea of the SD-AKI method is to first collect a set of characteristics of seen handsets in the training phase, and to then estimate the characteristic of the unknown testing handset by interpolating the set of seen handset characteristics in the test phase. The estimated handset characteristic is then used to compensate for handset mismatch for robust SID. The SD-AKI method can be realized in both feature and model spaces. Experimental results on the handset TIMIT (HTIMIT) database showed that both the proposed featureand model-space SD-AKI schemes were more robust than the blind cepstral mean subtraction (CMS), feature warping (FW) methods and their hard-decision counterpart (HD-AKI) for both cases of all-handset and unseen-handset SID tests. It is therefore a promising robust SID method.
منابع مشابه
Unseen handset mismatch compensation based on a priori knowledge interpolation for robust speaker recognition
Unseen handset mismatch is the major source of performance degradation for speaker recognition in telecommunication environment since handset distortions are tightly coupled with speaker characteristics. In this paper, a soft-decision unseen handset characteristics estimation method based on a priori knowledge interpolation is proposed to decouple the characteristics of the unseen handset and s...
متن کاملA robust aggregation operator for multi-criteria decision-making method with bipolar fuzzy soft environment
Molodtsov initiated soft set theory that provided a general mathematicalframework for handling with uncertainties in which we encounter the data by affix parameterized factor during the information analysis as differentiated to fuzzy as well as bipolar fuzzy set theory.The main object of this paper is to lay a foundation for providing a new application of bipolar fuzzy soft tool in ...
متن کاملRapid speaker adaptation by reference model interpolation
We present in this work a novel algorithm for fast speaker adaptation using only small amounts of adaptation data. It is motivated by the fact that a set of representative speakers can provide a priori knowledge to guide the estimation of a new speaker in the speaker-space. The proposed algorithm enables an a posteriori selection of reference models in the speakerspace as opposed to the a prior...
متن کاملRobust text-independent speaker identification using Gaussian mixture speaker models
This paper introduces and motivates the use of Gaussian mixture models (CMM) for robust text-independent speaker identification. The individual Gaussian components of a GMM are shown to represent some general speaker-dependent spectral shapes that are efTective for modeling speaker identity. The focus of this work is on applications which require high identification rates using short utterance ...
متن کاملEffect of Decision Rule on Speaker Recognition Performance
Speaker recognition from speech signal is still an ongoing research in forensics and biometrics area. Speaker recognition is the process to enable machine to recognize speaker's identity from their speech. The applications of speaker recognition technologies include access control system, security control for confidential information, and telephone banking. As a subset of speaker recognition, s...
متن کامل